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METHOD FOR DEPTH DETECTION IN 3D IMAGING PROVIDING A DEPTH 
MEASUREMENT FOR EACH UNITARY GROUP OF PIXELS 

BACKGROUND 

[0001] This is a continuation-in-part of Serial No. 09/660,809, filed on September 
13, 2000, entitled DIGITIZER USING INTENSITY GRADIENT TO IMAGE 
FEATURES OF THREE-DIMENSIONAL OBJECTS. 

Field of the Invention 

[0002] The invention relates to depth detection. More specifically, the invention 
relates to using individual pixels or elementary groups of pixels to determine 
distance from a reference. 

Background 

[0003] Various 3D imaging techniques exist for capturing three dimensional 
representations of three dimensional objects. To effect these captures it is necessary 
to determine depth from a reference point for each point represented. One way this 
has been done in the past is using a laser which sweeps over the target object. As the 
laser sweeps, the spot reflects off the object and strikes an image sensing array 
("ISA") where the spot strikes the ISA and indicates the depth of that point on the 
target from which it was reflected. Only a single depth measurement is captured for 
each captured period using this method. If we assume an ISA has 5,000 pixels, this 
implies a ratio of 1 to 5,000 in terms of depth measurements per pixel 
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[0004] Another common technique is to capture a projected pattern and interpret 
the distortion of the projected pattern. Using this technique, it is necessary to 
contextualize the distortion with the readings from adjacent pixels in order to 
identify and interpret a depth measurement for a particular location. While this 
technique improves the depth measurement per pixel ratio over the laser method 
discussed above, each measurement is dependent on the values captured by 
surrounding pixels and the ratio does not approach 1 to 1. 

[0005] Ultimately, the maximum resolution of a 3D image captured during a 
period of time is limited by the number of depth measurements that can be derived 
during that time period from the ISA. Maximum resolution and speed is achieved 
where each pixel provides a depth measurement for each capture period. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



[0006] The invention is illustrated by way of example and not by way of 
limitation in the figures of the accompanying drawings in which like references 
indicate similar elements. It should be noted that references to "an" or "one" 
embodiment in this disclosure are not necessarily to the same embodiment, and 
such references mean at least one. 

[0007] Figure 1 is a block diagram of a system of one embodiment of the 
invention. 

[0008] Figure 2 is a block diagram of a subsystem of one embodiment of the 
invention. 

[0009] Figure 3 shows a schematic diagram of a intensity gradient cone of one 
embodiment of the invention. 
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DETAILED DESCRIPTION 



[0010] Figure 1 is a block diagram of a system of one embodiment of the 
invention. The distributed network 100 such as the Internet provides an 
interconnection between a plurality of user nodes 110, a server node 120 and a host 
150. Server node 120 may be any conventional server or a collection of servers to 
handle traffic and requests over the distributed network. User nodes may be discrete 
computers running on their web browser, a corporate network, another server site, 
or any other node on the distributed network. Host 150 may be a computer (laptop, 
desktop, hand-held, server, workstation, etc.), an internet appliance or any other 
device through which data may be forwarded across the distributed network. 

[0011] The host 150 may communicate over a large wire link such as a universal 
serial bus (USB) or wireless link 162 to a digitizer 170. The digitizer 170 may be any 
of the myriad noncontact digitizers. One suitable digitizer is described in copending 
patent application Serial No. 09/660,809, entitled DIGITIZER USING INTENSITY 
GRADIENT TO IMAGE FEATURES OF THREE-DIMENSIONAL OBJECTS and 
assigned to the assignee of the instant application. 

[0012] In one embodiment, digitizer 170 is physically independent of an 
orientation fixture 180. For user convenience, it is desirable to minimize space 
permanently allocated to the system and minimize setup time. Most users will not 
be able to allocate sufficient space to leave the system configured for use at all times. 
The user will therefore be required to reintroduce some portion of the system prior 
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to each use. The need to swap cables and otherwise rewire serves as a significant 
deterrent to wide spread consumer adoption. 

[0013] As used herein, "physically independent" means that no mechanical or 
wired electrical connection must exist between the physically independent units 
during operation. By way of example and not limitation, two devices coupled 
together by an electrical signaling wire either directly or through a host computer, 
are not physically independent, whereas two devices that have no physical coupling 
and communicate over a wireless link are deemed "physically independent/' 
Connection to a common power source, e.g., two outlets in a house, is not deemed 
to destroy physical independence. 

[0014] Orientation fixture 180 repositions an object to be digitized by digitizer 170 
such that different aspects of the object are exposed relative to the digitizer at 
different points in time. In one embodiment the orientation fixture 180 is a 
turntable. One suitable turntable is described in copending application Serial No. 
09/660,810 entitled WIRELESS TURNTABLE and assigned to the assignee of the 
instant application. Orientation fixture may also be a robotic arm or other robotic 
device, or maybe a turntable in conjunction with a robotic arm or other robotic 
device. Other mechanisms that are capable of exposing different aspects of an object 
relative to the digitizer are deemed to be within the ambit of orientation fixtures. 

[0015] As previously noted the orientation fixture is physically independent of 
the digitizer. One premise of the system is relative ease of setup to facilitate wide 
acceptance. Thus, with the physical independence it is desirable that the digitizer 
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170 and orientation fixture 180 be able to "find" each other. To that end, the digitizer 
170 may be equipped to sweep an area looking with its sensing apparatus for a 
feature of the orientation fixture 180. The orientation fixture 180 may include a 
feature such as indicia, for example, acquisition indicia 188, or may contain some 
other physically observable structure that permits the digitizer to identify and 
acquire the orientation fixture 180 without the user introducing or removing a 
separate reference object. Acquiring the orientation fixture may permit, for 
example, any of automatic calibration of the digitizer, automatic determination of 
the relative position of the digitizer and orientation fixture, and fixture's orientation 
or condition. In one embodiment, imaging the feature provides an indication of 
focal distance as the perspective of the feature varies in a known way with distance. 
Calibration may be performed by imaging the feature and comparing the results to a 
set of reference data corresponding to the feature. In this manner the digitizer 
settings can be automatically optimized to provide the best available accuracy under 
existing conditions. Alternatively, the calibration can be performed based on a 
reference target or path entirely within the digitizer. 

[0016] Alternatively, the orientation fixture may have a localized radiation 
source 186, which permits the digitizer 170 to sweep and identify the location of the 
orientation fixture based on the localized radiation from radiation source 186. It is 
also within the scope and contemplation of the invention to have the orientation 
fixture 170 position itself relative to the digitizer, such that the orientation fixture 
controls the acquisition by the digitizer 170 of the orientation fixture 180 and the 
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object to be oriented thereby. In the system of such embodiment the orientation 
fixture would likely be a mobile robotic unit. 

[0017] In one embodiment, the digitizer communicates with the orientation 
fixture across a wireless link 184 to coordinate the orientation of the object with 
image capture by the digitizer. The wireless link may be infrared ("IR"), radio 
frequency ("RF") , optical signaling, or any other mode of wireless communication. 
In one embodiment the orientation fixture 180 includes a self contained power 
source 194 such as a battery. The self-contained power source 194 may also be a solar 
panel, fuel cell, or any other suitable power source. 

[0018] In one embodiment of the invention, digitizer 170 captures information 
about an object positioned by orientation fixture 180 from which a three- 
dimensional model can be derived. Controller 192 in digitizer 170 controls the 
coordination between the data capture by digitizer 170 and aspect change by the 
orientation fixture 180. It is within the scope and contemplation of the invention 
for the controller to reside in the host, the digitizer, the orientation fixture or in an 
independent unit. References to the controller herein are deemed to include 
without limitation all of these options. The digitizer 170 may also include a data 
analyzer 196 that reviews captured data to find errors, anomalies or other points of 
interest that warrant further investigation, including possibly rescanning the 
corresponding area. After any corrective action, the data captured by digitizer 170 is 
passed to the host 150, which renders the three-dimensional model from the data. 
The host 150 may perform compression or any other manipulation of the data 
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known in the art. The three-dimensional model may then be sent over distributed 
network 100 to remote nodes such as user nodes 110 or a server node 120. This 
provides maximum ease of distribution across the distributed network 100. 

[0019] In some cases, control of distribution of information captured by the 
digitizer is desirable, for example, to facilitate administration of user fees. To that 
end, in one embodiment the digitizer is provided with a hardware interlock 190 
which prevents the system from operating without first receiving authorization. 
Such authorization may be provided by the server node 120 sending authorization 
data across the distributed network. Alternative locking mechanisms such as 
software or firmware-based locking mechanisms may also be employed either 
within the digitizer 170 or the host 150. Further security of the system can be 
affected by requiring an imaging application 152 on the host 150 to provide a valid 
digital signature in addition to the authorization data before enabling capture 
and/or transfer of captured data from the digitizer 170 to the host 150. 

[0020] Some embodiments of the digitizer 170 may encrypt the data captured 
prior to sending it to the host 150. In that event, unless the host is able to decrypt 
the data to render it, it may forward it on to the server node 120 across the 
distributed network and subsequent rendering of the image or three-dimensional 
model would occur on the server node 120. In this manner, the local user does not 
have access to the data from which the three-dimensional model may be derived 
unless a key is provided. In still another embodiment, the host 150 may include 
encryption capabilities and encrypt the rendered image before forwarding it on to 
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the server node 120. Keying information may be provided to the digitizer and/ or 
the host by the server node 120. The server node may maintain keying information 
and authorization data in a local data base 122. Once the three-dimensional data is 
safely controlled by the server node 120, access to the data may be made available for 
free or at cost to the user nodes 110 or back to the host 150. 

[0021] The digitizer may also include a field programmable gate array ("FPGA") 
or other reconfigurable logic unit. In such case, the server node periodically may 
reprogram the FPGA to implement an updated or enhanced algorithm for 
processing or security purposes, for example, as subsequently developed. 

[0022] Figure 2 is a block diagram of a subsystem of one embodiment of the 
invention. The subsystem of Figure 3 may be inserted in place of host 150, digitizer 
120 and orientation fixture 180 of Figure 1. Digitizer 70 is coupled to a host 50. This 
coupling may be by a bus 60 such as the Universal Serial Bus (USB), IEEE 1394 bus, or 
any other suitable data transfer system. It is also within the scope and 
contemplation of the invention for the digitizer to communicate with the host 
mode via a wireless interconnection. Host 50 may be a personal computer, a work 
station, an internet appliance, or any other device that provides sufficient 
intelligence and processing power to render images from the data obtained by the 
digitizer. The digitizer 70 captures image data and may forward it to the host 50 for 
rendering. In this way, the processing on the digitizer 70 may be limited, permitting 
lower cost construction. It is also within the scope and contemplation of the 
invention for the digitizer to render the image and deliver it directly to a distributed 
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network. It is further within the scope and contemplation of the invention for the 
digitizer to deliver the data to a distributed network for rendering on a remote node. 

[0023] The digitizer 70 includes a projector to project a stripe of white light 
through a projection window 74 onto a remote object such as a person 82 on a 
turntable 80 remote from the digitizer. The digitizer also contains an image sensing 
array (ISA) aligned with an image capture window 76 which captures the image of 
the object 82 within a focal zone. In one embodiment, the ISA is a linear charge 
coupled device (CCD) or complementary metal oxide semiconductor (CMOS) sensor, 
and the focal zone is a line on the target object. In some embodiments, the digitizer 
includes a base 72 about which the upper unit, including the projector and the ISA, 
can rotate in either direction. This permits the focal line to be swept back and forth 
across a target object through an arc. This sweeping reduces the loss of detail in the 
captured image that results from shadowing on the object from the perspective of an 
immobile focal line. The digitizer 70 also includes a wireless interface to 
communicate with a turntable 80 via a wireless link 84. 

[0024] Turntable 80 may be the type described in co-pending application entitled 
WIRELESS TURNTABLE, Serial No. 09/660,810, assigned to the assignee of the 
instant application. Via wireless link 84, the digitizer sends commands to the 
turntable 80 and receives from the turntable indications of the angular position of 
the turntable surface relative to a home position. When the digitizer is activated, it 
searches for the turntable 80 by sending a signal to which the turntable 80 is required 
to respond. If the turntable responds, the digitizer looks for a predetermined pattern 
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that is expected to be present on the turntable surface. For example, the pattern may 
be concentric circles on the turntable surface. In such case, based on the image 
captured, the digitizer can both find the turntable and determine its distance from 
the digitizer. Then after the response is received, the digitizer sends a "go home" 
signal to the turntable. In some embodiments, the digitizer sends acceleration and 
rotation profiles to the turntable to control its rotation. Each profile may be retained 
in firmware on the digitizer or downloaded from host 50. 

[0025] Generally speaking, the projection portion of the digitizer 70 is retained in 
fixed relation to the imaging portion. The projection portion produces a light stripe 
as noted previously on the object 82. By either sweeping the light stripe back and 
forth through the focal line or by mechanically blocking the stripe at a known rate, 
the intensity gradient can be created. In one embodiment, the blocking is from 0% 
to 100% during a cycle. Because the ISA integrates the illumination over time, the 
outline of a three-dimensional surface is reflected in the data captured by the ISA. 
This is because protruding features will remain illuminated longer. Accordingly, 
more photons are captured by the ISA corresponding to those features. After 
repeating this process one stripe at a time as the object is rotated by turntable 80 or 
through the course of sweeping the entire digitizer back and forth as it rotates about 
the base, cost effective three-dimensional imaging is effected. 

[0026] In one embodiment, the system operates on the principle that depth data 
for a three-dimensional object may be calculated from an intensity difference 
resulting from an intensity gradient projected on the object. Existing image sensing 
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arrays (ISAs) such as linear charge coupled device (CCD) sensors can detect 
illumination intensity to a high degree of accuracy. Based on this principle, if a light 
source is placed in fixed relation to the ISA such that the projected light forms an 
angle with the focal line of the ISA r and a gradient slide, for example, going from 
dark to light, from left to right, is interposed between the light source and the object, 
features of the object closer to the ISA are illuminated by greater intensity light than 
those features further away. Thus, the ISA captures a stripe of the object in which 
different intensities represent different depths of the object in that focal zone. This 
general principle works well for uniformly colored objects imaged in an otherwise 
dark environment, but different coloring and ambient light conditions may cause 
misinterpretations of the intensity data. However, if the ISA images the same stripe 
of the object under ambient conditions (e.g., when the light source is not 
illuminating the object within the focal zone) and images again when the object is 
illuminated by a uniform light (e.g., with no gradient (flat gradient)), these possible 
misinterpretations can be avoided. 

[0027] Particularly, the ratio V G1 - VJ V G2 - V A yields a differential that can be 
mapped to depth of the object. In the differential, V G1 is the value from the ISA at a 
point resulting from the gradient exposure, V A is the value from the ambient 
exposure at that point, and V G2 is the value at the point from a second gradient 
exposure such as the uniform light (flat gradient) or a second gradient created as 
described further below. The differential is computed for each point in the focal 
zone. Moreover, this differential also normalizes the effect of color variations and 
ambient light conditions. Notably, the differential is also substantially independent 
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of intensity of the light source. Unfortunately, as a practical matter, changing slides 
and/or turning the light source on and off rapidly enough to permit digitization of 
many possible target objects is both expensive and problematic. 

[0028] However, by taking advantage of the fact that the ISA integrates over time, 
the same effect may be created mechanically using a shutter which causes 0% to 
100% of the light to illuminate the target object within the focal zone during the 
cycle. Moreover, by overdriving the shutter, the white light condition and ambient 
condition, can be created. Specifically, if the imaging time of the CCD is 5 
milliseconds, in an initial 5 milliseconds the shutter does not impinge on the light 
source, thereby allowing the imaging sensing array to image the fully illuminated 
object. The next 5 milliseconds, the shutter passes from 0 to 100% blockage of the 
light, thereby creating the intensity gradient within the focal zone. During the next 
5 milliseconds, the shutter continues to drive so that the light is entirely blocked 
and the ambient condition image is obtained. The processing of each of these 
images (including the creation of the differential) may be offloaded to an attached 
host as discussed in greater detail below. 

[0029] An intensity gradient may alternatively be created by sweeping the light 
through the focal zone. For example, by sweeping a light stripe from left to right 
through the focal zone, the ambient light image may be captured before the light 
enters the zone. A first gradient is captured from the first entry of the light into the 
zone until the light is entirely within the zone. A second gradient is captured as a 
light translates out of the zone to the right. The second gradient is the opposite of 



004956.P006X 



13 



Express Mail No. EL651820479US 



the first gradient and is not flat as in the fully illuminated case. An analogous set of 
images may be captured as the light sweeps back from left to right. One advantage of 
sweeping the light is that two gradients are generated as the light moves from right 
to left and two gradients are generated as the light moves from left to right. Thus, 
the sweeping can be performed at half speed without a reduction in imaging 
performance. 

[0030] The differential may take the same form as discussed above. 
Alternatively, the differential may be computed as XJiX^ X 2 ), where X a = V G1 - V A 
and X 2 = V G2 -V A . To reduce noise sensitivity, the larger magnitude gradient should 
be selected for the numerator of the ratio. Color intensity is given by X 1 + X 2 . 

[0031] Figure 3 shows a schematic diagram of a intensity gradient cone of one 
embodiment of the invention. This example has been simplified to include only a 
single capture of one gradient as might be applicable to uniformly shaped and 
colored objects in a dark environment. This example, of course, could be expanded 
to cover non-uniform cases as discussed above using intensity differentials. A light 
source is located a distance L along a line normal to a line of sight of a linear ISA. 
The light source projects a gradient having a minimum intensity I A and maximum 
intensity I B . The angle 0 A corresponds to the angle between the minimum intensity 

edge of the projected gradient and the line normal to the ISA line of sight. 9 B 
corresponds to the angle defined by the maximum intensity edge of the projected 
gradient and the normal line. 9 corresponds to the angle defined by a line from the 
gradient origin to the point on the target for which distance is to be determined and 
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the normal line. Between I A and I B/ intensity is proportioned to the angle 0 and is 
given by 1(0) = (I B - I A ) [(9 - 9 J/ (9 B - 9 A )] + I A . 



[0032] Thus, angle as a function of intensity is given by 9(1) = (9 B - 9 A ) * [(I - I A )/ 

(I B - I A )] + 0 A . The distance D is the given by D = L * tan 9. Therefore, distance as a 

function of intensity measured at the ISA is given by D = L * tan {(0 B - 0 A ) * [(I - I A )/ 

(I B - I A )] + 9 A }. In one embodiment, the light source is fixed relative to the ISA and 
the minimum and maximum intensities are known or can be calibrated at run 
time. This allows intensity derived during capture to be mapped easily to distance 
with limited computational complexity. 

[0033] In one embodiment, depth measurements are determined on a pixel by 
pixel basis. Because intensity captured by the single pixel is independent of intensity 
captured by other pixels and intensity maps directly to depth, it is possible to achieve 
one depth measurement per pixel. Moreover, it would be possible to obtain a depth 
measurement from an image sensor having a single sensing element. In an 
alternative embodiment, adjacent groups of pixels may be treated as an element 
with the aggregate captured intensity used to determine the depth measurement 
without regard to an energy distribution among the pixels in the group. This yields 
one measurement per elementary group of pixels. 

[0034] As indicated, it is possible to calculate the depth data directly from the 
intensity information. However, the speed and processing power required are 
reduced when a lookup table (LUT) based on a prior calibration is employed to 
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derive depth data based on the differentials. This also allows nonidealities of the 
physical equipment to be accounted for in the selected LUT entity, based on the prior 
calibration. Accordingly, the embodiment of the invention maintains a LUT and 
indexes into the LUT based on the differential. 

[0035] In the foregoing specification, the invention has been described with 
reference to specific embodiments thereof. It will, however, be evident that various 
modifications and changes can be made thereto without departing from the broader 
spirit and scope of the invention as set forth in the appended claims. The 
specification and drawings are, accordingly, to be regarded in an illustrative rather 
than a restrictive sense. 
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